study on unit-selection and statistical parametric speech synthesis techniques

نویسندگان

mohammad savargiv

faculty of computer and information technology engineering, qazvin branch, islamic azad university, qazvin, iran azam bastanfard

faculty of media engineering, islamic republic of iran broadcast university, tehran, iran

چکیده

one of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. speech synthesis is granting human abilities to the computer for speech production. data-based approach and process-based approach are the two main approaches on speech synthesis. each approach has its varied challenges. unit-selection speech synthesis and statistical parametric speech synthesis are two dominant speech synthesizer techniques. the naturalness is the main challenge of all speech synthesis approaches. the intonation, speech style and emotional state are included in naturalness factor and all of them are considered as suprasegmental features. equipped synthesized speech with paralinguistic information is more believable from the perceptual aspect. prosody information plays an important role on the synthesized speech quality of text to speech systems. the first purpose of modern speech synthesizer systems is text to speech conversion and the second purpose is transferring the emotional states of text in the voice form. in this paper two main speech synthesis approaches and their challenges are investigated in detail.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques

One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...

متن کامل

Analysis of statistical parametric and unit selection speech synthesis systems applied to emotional speech

We have applied two state-of-the-art speech synthesis techniques (unit selection and HMM-based synthesis) to the synthesis of emotional speech. A series of carefully designed perceptual tests to evaluate speech quality, emotion identification rates and emotional strength were used for the six emotions which we recorded – happiness, sadness, anger, surprise, fear, disgust. For the HMM-based meth...

متن کامل

Statistical Modeling for Unit Selection in Speech Synthesis

Traditional concatenative speech synthesis systems use a number of heuristics to define the target and concatenation costs, essential for the design of the unit selection component. In contrast to these approaches, we introduce a general statistical modeling framework for unit selection inspired by automatic speech recognition. Given appropriate data, techniques based on that framework can resu...

متن کامل

Unit Size in Unit Selection Speech Synthesis

In this paper, we address the issue of choice of unit size in unit selection speech synthesis. We discuss the development of a Hindi speech synthesizer and our experiments with different choices of units: syllable, diphone, phone and half phone. Perceptual tests conducted to evaluate the quality of the synthesizers with different unit size indicate that the syllable synthesizer performs better ...

متن کامل

Unit size in unit selection speech synthesis

In this paper, we address the issue of choice of unit size in unit selection speech synthesis. We discuss the development of a Hindi speech synthesizer and our experiments with different choices of units: syllable, diphone, phone and half phone. Perceptual tests conducted to evaluate the quality of the synthesizers with different unit size indicate that the syllable synthesizer performs better ...

متن کامل

Statistical parametric speech synthesis for Ibibio

Ibibio is a Nigerian tone language, spoken in the south-east coastal region of Nigeria. Like most African languages, it is resource-limited. This presents a major challenge to conventional approaches to speech synthesis, which typically require the training of numerous predictive models of linguistic features such as the phoneme sequence (i.e., a pronunciation dictionary plus a letterto-sound m...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید


عنوان ژورنال:
journal of computer and robotics

جلد ۷، شماره ۱، صفحات ۱۹-۲۵

کلمات کلیدی

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023